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Abstract. We deal with a nonlocal interaction equation describing the evolution of a parti- 
cle density under the effect of a general symmetric pairwise interaction potential, not neces- 
sarily in convolution form. We describe the case of a convex (or A-convex) potential, possibly 
not smooth at several points, generalizing the results of [CDFLSj . We also identify the cases 
in which the dynamic is still governed by the continuity equation with well-characterized 
nonlocal velocity field. 

1. Introduction 

Let us consider a distribution of particles, represented by a Borel probability measure \i on 
M. d . We introduce the interaction potential W : W 1 x M, d — > E. The value W(rr, y) describes 
the interaction of two particles of unit mass at the positions x and y. The total energy of a 
distribution \x under the effect of the potential is given by the interaction energy functional, 
defined by 

(1.1) W(m) -=\ I W(x, y) d{ii x y). 

We assume that W satisfies the following assumptions: 

i) W : M. d x R d — > E is symmetric, i.e. 

(1.2) W(x, y) = W(y, x) for every x, y € 

ii) W is a A-convex function for some A £ R, i.e. 

(1.3) the function (x, y) \- > W(x, y) — — (\x\ 2 + \y\ 2 ) is convex; 

Hi) W satisfies the quadratic growth condition at infinity, i.e. there exists a constant 
C > such that 

(1.4) W(x, y) < C(l + \x\ 2 + \y\ 2 ). 

We are interested in the evolution problem given by the continuity equation 

(1.5) dtfi{t) + div (v(t)n(t)) = 0, a*(0) = mo, 

describing the dynamics of the particle density, whose total mass is conserved, under the 
mutual attraction-repulsion force given by (jl.ip . The velocity vector field v enjoys a nonlocal 
dependence on /i. In the basic model represented by a C 1 potential W which depends only 
on the difference of its variables, so that we may write W(x,y) = W(x — y), it is given by 
convolution: 

(1.6) v(t)=VW*fi(t). 



2010 Mathematics Subject Classification. 49K20, 35F99. 

1 



Under the assumptions (jl.2p . fjl .3[) . (jl.4p . in general the function is not differentiable but 
only subdifferentiable, therefore it is reasonable to consider a velocity field of the form 

(1.7) v(t) = rj(t)*fi(t), 

where rj represents a Borel measurable selection in the sub differential of W, and we will write 
rj £ dW. In general, such selection is not independent of t. We stress that x i— > r](x) needs to 
be a pointwise defined object, since the solutions we consider are probability measures, and 
since this model typically presents concentration phenomena when starting with absolutely 
continuous initial data. 

In this paper, we are going to analyse equations of the form (|1.5p - (|1.7p as the gradient flow 
of the interaction energy (jl.ip in the space of Borel probability measures with finite second 
moment, endowed with the metric-differential structure induced by the so-called Wasserstein 
distance. This interpretation coming from the optimal transport theory was introduced in 
[O] for nonlinear diffusion equations and generalized for a large class of functionals including 
potential, interaction, and internal energy by different authors [CMV1 IAGSI ICMV2] . see 
IV2| for related information. 

The gradient flow interpretation allows to construct solutions by means of variational 
schemes based on the euclidean optimal transport distance as originally introduced in [JKO] 
for the linear Fokker-Planck equation. The convergence of these variational schemes for gen- 
eral functionals was detailed in |AGS| . The results in this monograph, which are quickly 
summarized in Section [21 apply to the interaction equation (ll.5| )- (|1.6p . with a C 1 smooth 
potential verifying the convexity assumption (II. 3p and a growth condition at infinity stricter 
than (fL4"|) . 

On the other hand, these equations have appeared in the literature as simple models of 
inelastic interactions |MYl IBCPj IBCCPt IT] in which the asymptotic behavior of the equations 
is given by a total concentration towards a unique Delta Dirac point measure. The typical 
potential in these models was a power law, W(x,y) = \x — y\ a , a > 0. Moreover, it was 
noticed in [LT| that the convergence towards this unique steady state was in finite time for 
certain range of exponents in the one dimensional case. 

Also these equations appear in very simplified swarming or population dynamics models 
for collective motion of individuals, sec [MEBSl IBL[ IBCL[ IKSUBl IBCLR] and the references 
therein. The interaction potential models the long-range attraction and the short-range re- 
pulsion typical in animal groups. In case the potential is fully attractive, equation (|1.5p 
is usually referred as the aggregation equation. For the aggregation equation, finite time 
blow-up results for weak-L p solutions, unique up to the blow-up time, have been obtained 
in the literature [BCI4 iBLRl ICRj . In fact, those results conjectured that solutions tend to 
concentrate and form Dirac Deltas in finite time under suitable conditions on the interaction 
potential. On the other hand, the confinement of particles is shown to happen for short-range 
repulsive long-range attractive potentials under certain conditions [CDFLS2] . Some singular 
stationary states such as uniform densities on spheres have been identified as stable/unstable 
for radial perturbations in [BCLRj with sharp conditions on the potential. Finally, in the one 
dimensional case, stationary states formed by finite number of particles and smooth stationary 
profiles are found whose stability have been studied in |FRH IFR2j in a suitable sense. 

A global-in-time well-posedness theory of measure weak solutions have been developed in 
[CDFLSj for interaction potentials of the form W(x, y) = W{x — y) satisfying the assump- 
tions (ll.2p . (ll.3p . (I1.4p . and additionally being C 1 -smooth except possibly at the origin. The 
convexity condition (ll.3p restricts the possible singularities of the potential at the origin since 
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it implies that W is Lipschitz, and therefore the possible singularity cannot be worse than \x\ 
locally at the origin. Nevertheless, for a class of potentials in which the local behavior at the 
origin is like \x\ a , 1 < a < 2, the solutions converge towards a Delta Dirac with the full mass 
at the center of mass of the solution. The condition for blow-up is more general and related 
to the Osgood criterium for uniqueness of ODEs jBCH ICDFLS[ IBLRj . Note that the center 
of mass of the solution is preserved, at least formally, due to the symmetry assumption (jl.2p . 

In this work, we push the ideas started in jCDFLS] further in the direction of giving con- 
ditions on the interaction potential to have a global-in-time well-posedness theory of measure 
solutions. The solutions constructed in Section [2] will be gradient flow solutions, as in |AGS| . 
built via the variational schemes based on the optimal transport Wasserstein distance. The 
crucial point for the analysis in this framework is the identification of the velocity field in 
the continuity equation satisfied by the limiting curve of measures from the approximat- 
ing variational scheme. In order to identify it, we need to characterize the sub-differential 
of the functional defined in (jl.ip with respect to the differential structure induced by the 
Wasserstein metric. The Wasserstein sub-differential of the functional W, which is rigorously 
introduced in Section [2j is defined through variations along transport maps. It turns out that 
that the element of minimal norm in this sub-differential, which will be denoted by <9°W(-), 
is the element that governs the dynamics. Actually, it gives the velocity field via the relation 
v(t) = —d°W(n(t)) for a.e. t G (0,oo), which corresponds to the notion of gradient flow 
solution. This notion will be discussed in Section [2j where we will give the precise definition 
and recall from [AGS, Chapter 11] the main properties, such as the semigroup generation. 

In Section El we give a characterization of the subdifferential in the general case of the inter- 
action potential W(x, y) satisfying only the basic assumptions (jl.2D . (jl.3p . and (11 .41) . However, 
the element of minimal norm in the subdifferential is not fully identified and cannot be uni- 
versally characterized. Nevertheless, the global well-posedness of the evolution semigroup in 
measures is obtained. 

A distinguished role will be played by the case of a kernel function W(x, y) which depends 
only on the difference x — y of its arguments. Hence we will often consider one of the following 
additional assumptions. 

iv) There exists W : R d -)• R such that 

(1.8) W(x,y)=W(x-y). 

v) There exists w : R — > R such that 

(1.9) W(x,y) = W{x-y) = w{\x-y\). 

The radial hypothesis is frequently made in models, and corresponds to an interaction between 
particles which depends only on their mutual distance vector. In case W(x, y) is also radial 
and convex, we can fully generalize the identification of the element of minimal norm in 
the subdifferential of the interaction energy done in |CDFLS| . regardless of the number of 
nondifferentiability points of W . We complement our results with explicit examples showing 
the sharpness of these characterizations. 

Before to state the results and in order to fix notations we recall the characterization of 
subdifferential for A-convex functions. Given a A-convex function V : R fc — > R, a vector £ 
belongs to the subdifferential of V at the point x € R fc if and only if 

(1.10) V(z)-V(x) > (£,z-x) + ^\z-x\ 2 VzGR fc , 
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and we write £ G dV(x). In this case, for every x G K fc , we have that <9V(x) is a not empty 
closed convex subset of ffi fc . We denote by d°V(x) the unique element of minimal euclidean 
norm in dV(x). 

The main results. Let us give a brief summary of the results contained in this paper. The 
main theorem deals with radial-convex potentials and reads as follows. 

Let W satisfy the three basic assumptions above: (jl.2p . (jl.3p . and (jl.4p . If in addition 
W satisfies (|1.8p . (|1.9p and zs convex (that is, A > in (jl.3p ). i/ien i/iere exists a unique 
gradient flow solution to the equation 

(1.11) - div {{d°W * n(t))fi(t)) = 0. 

This solution is the gradient flow of the energy W, in the sense that the velocity field in 
(ll.lip satisfies 

d°W*n(t) = d°W(fi(t)). 

On the other hand, when omitting the radial hypothesis (11. 9p . or when letting the potential 
be A-convex but not convex, we show that the evolution of the system under the effect of 
the potential, that is the gradient flow of W, is characterized by (|1.5p - (|1.7p . where rj(t) G 
is a Borel anti-symmetric selection in dW for almost every t. The corresponding rigorous 
statement is found in Section [2j 

About this last result, let us remark that the velocity vector field is still written in terms of 
a suitable selection r] in the local subdifferential of W, but such selection ij is not in general 
the minimal one in dW, and it is not a priori independent of t. By this characterization we 
also recover the result of |CDFLSj . where the only non smoothness point for W is the origin: 
in such case, for any t we are left with rj(t)(x) = VW(x) for x ^ and rj(t)(0) = for x = 0, 
by anti-symmetry. We stress that, due to the nonlocal structure of the problem, the task of 
identifying the velocity vector field becomes much more involved when W has several non 
smoothness points, even if it is A-convex. Later in Section 01 we will analyse some particular 
examples, showing that in general it is not possible to write the velocity field in terms of a 
single selection in dW. 

Finally, when omitting also the assumption f 1 1 . 8 [) . we break the convolution structure: in 
this more general case we show that the velocity is given in terms of an element of d\W, or 
equivalently of by symmetry, where d{W and denotes the partial subdifferentials 
of W with respect to the first d variables or the last d variables respectively, 

v(t)(x) = [ <n(t)(x,y)dfi(t)(y), 

where rj(t) G c?iW for almost any t. An additional joint subdifferential condition is also be 
present in this case, for the rigorous statement we still address to the next section. 

Pointwise particle model and asymptotic behavior. In the model case of a system of 
N point particles, discussed in Section [5J the dynamics are governed by a system of ordinary 
differential equations. In this case equation (|1.5p - (|1.7p corresponds to 

=i>2 rn 3 r l(t)(xj -Xi), i = l,...,N, 

where Xi(t) is the position of the i-th. particle and mi is its mass. It is shown in [CDFLS] 
that if the attractive strength of the potential is sufficiently high, all the particles collapse 
to the center of mass in finite time. We will remark how this result is still working under 
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our hypotheses under the same Osgood criterium as in |BCH ICDFLSj for fully attractive 
potentials. For non-convex non-smooth repulsive-attractive potentials, albeit A-convex, the 
analysis leads to non-trivial sets of stationary states with singularities that cannot be treated 
by the theory in [CDFLS] . Our analysis shows that a very wide range of asymptotic states is 
indeed possible, we give different explicit examples. 

Plan of the paper. In the following Section [2] we introduce the optimal transport frame- 
work and the basic properties of our energy functional, in particular the subdifferentiability 
and the A-convexity along geodesies. We briefly explain what is a gradient flow in the metric 
space £? > 2{^ d ) and we introduce the notion of gradient flow solution. We present the gen- 
eral well-posedness result of [AGS] and show how it will apply to our interaction models, 
stating our main results. In Section [3l we make a fine analysis on the Wasserstein subdiffer- 
ential of W and find a first characterization of its element of minimal norm. In Section [J] we 
particularize the characterization to the case of assumption (jl.8p . which is the convolution 
case. In particular, we have the strongest result in the case of assumption (jl.9p . Section [5] 
relates these arguments to the finite time collapse results in |CDFLS| showing the character- 
ization of the velocity field for particles. Section [6] gives examples of non-smooth non-convex 
repulsive-attractive potentials, albeit A-convex, leading to non-trivial sets of stationary states. 
Finally, the Appendix [7] is devoted to recall technical concepts from the differential calculus 
in Wasserstein spaces which are needed in Section [3l 

2. Wasserstein subdifferential and gradient flow of the interaction energy 

2.1. Optimal transport framework. We denote by &2$& d ) th e space of Borel probability 
measures over R d with finite second moment, i.e., the set of Borel probability measures \i 
such that 

/ |x| 2 d/j,(x) < oo . 

The convergence of probability measures is considered in the narrow sense defined as the 
weak convergence in the duality with continuous and bounded functions over M. d . Given 
fj,, v G ^2(R d ) and 7 G T( / u, z/), where 

r(/i, v) : = {7 G &> 2 (R d x R d ) : 7(0 x R d ) = 7 (R d x O) = 

for every Borel set S]cK d }, 

the euclidean quadratic transport cost between /1 and v with respect to the transport plan 7 
is defined by 

C(m,^;7)=(/ \x-y\ 2 d"f(x,y)\ 
The 'Wasserstein distance" between \x and v is defined by 
(2.1) dw{\i,v)= inf C(n,v;i). 

It is well known that the inf in (|2. 1[) is attained by a minimizer. The minimizers in (|2.ip are 
called optimal plans. We denote by r o (/i, v) C v) the set of optimal plans between fj, and 
v. It is also well known that \i n — > \i in ^2(lR rf ) (he. dw(fJ. n , ^) 0) if and only if \i n narrowly 
converges to fi and J Rd \x\ 2 d/j, n (x) — > L d \x\ 2 d[i(x). The space & > 2{R d ) endowed with the 
distance dw is a complete and separable metric space. For all the details on Wasserstein 
distance and optimal transportation, we refer to |AGS( IV2] . 
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We recall the push forward notation for a maps : (R d ) m -»• (R d ) k , m, k > 1, and a measure 
fj, G ^((R d ) m ): the measure s # ^ G @>((R d ) k ) is defined by s#[i(A) = /x(s _1 (A)), where ,4 is 
a Borel set. A transport plan 7 G r(/i, u) may be induced by a map s : M d — > R d such that 
s#/U = za This means that 7 = (i, s)#/i, where i : — ?• R d denotes the identity map over 
R and (i, s) : R d ->■ ]R d x M d is the product map. Finally, 7r J will stand for the projection 
map on the j-th. component of a product space. Hence, if 7 is a probability measure over a 
product space (for instance a transport plan), 7r^7 is its j-th. marginal. 

The first properties of the interaction potential functional W given by (jl.ip are contained 
in the next Proposition. 

Proposition 2.1. Under assumptions (jl.3j) and (11. 4ft . the functional W is lower semicontin- 
uous with respect to the d\y metric and enjoys the following \-convexity property: for every 
fi,u G ) and every 7 G u) it holds 

(2.2) < (1 - t)W(ji) + tW(v) - ^(1 - *)C 2 (m, 1/; 7), 

w/iere 6> 7 denotes the interpolating curve t G [0, 1] H> 6> 7 (i) = ((1 - t)n l + tir 2 )^ G ^(M^). 

The lower semicontinuity follows from standard arguments, for the convexity along inter- 
polating curves we refer to [AGS I §9.3]. In particular, since every constant speed Wasserstein 
geodesic is of the form 6" ( where 7 is an optimal plan, then W is A-convex along every Wasser- 
stein geodesies. We adapt from [AGS| the definition of the Wasserstein subdifferential for the 
A-convex functional W. 

Definition 2.2 (The Wasserstein subdifferential of W). Let /i G & 2 Q& d )- We say that 
the vector field £ G L 2 (R d , /x; R d ) belongs to dW(fj,), the Wasserstein subdifferential of the 
X-convex functional W : > 2{^ d ) - > (—00, +00] at the point \i, if for every v G S?2^ d ) there 
exists 7 G F (fj,,i/) such that 

(2.3) W(v)-W(fi)> I (Z(x),y-x)d 1 (x,y) + ^C 2 ( f i,v }1 ). 

We say that that £ G dsW(fi), the strong subdifferential of W at the point fi, if for every 
v G &2^ d ) and for every admissible plan 7 G T(fi,u), (|2.3p holds. 

The metric slope of the functional W at the point [i is defined as follows: 

where (a) + denotes the positive part of the real number a. Since W is A-convex we know that 

(2.4) \dW\Qj.) = mm{u\\ LHRd ^ d) : £ G dW(/i)} . 

Moreover, the element realizing the minimal norm in (|2.4|) is unique and we denote it by 
<9°W(/u) (see [AGS! Chapter 10]). The element of minimal norm in the subdifferential plays 
a crucial role, since it is known to be the velocity vector field of the evolution equation 
associated to the gradient flow of the functional (ll.ip under certain conditions as reviewed 
next. 
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2.2. The gradient flow solution. As already shown in [CDFLSj . we are forced to consider 
a notion of solution which only assumes that the densities are in fact, measures. Actually, in 
case of attractive radial potentials verifying assumptions (|1.2p . (|1.3p and (|1.4p . i.e. W(x,y) = 
w(\x — y\), with w increasing, it was shown in [BCL] IBLRj that weak-L p solutions blow- 
up in finite time. Moreover, these weak-L p solutions can be uniquely continued as measure 
solutions, as proved in [CDFLSj . leading to a total collapse in a single Delta Dirac at the 
center of mass in finite time. Furthermore, particle solutions, i.e, solutions corresponding to 
an initial data composed by a finite number of atoms, remain particle solutions for all times 
for the evolution of (jl.5p . Summarizing we can only expect that a regular solution enjoys 
local in time existence. 

Our well-posedness results are based on the following abstract theorem for gradient flow 
solutions. For all the details we refer to [AGS! Theorem 11.2.1], where some more properties 
of these solutions are remarked. 

Before stating the Theorem, we say that a curve t G [0, oo) i— > 

At(t) G ^>2{^ d ) is locally ab- 
solutely continuous with locally finite energy, and we denote it by \x £ AC£ c ([0,oo); ^i^)), 
if the restriction of \x to the interval [0, T] is absolutely continuous for every T > and its 
metric derivative, which exists for a.e. t > defined by 

fj, (t) := hm : , 

8-tt \t — S\ 

belongs to L 2 (0,T) for every T > 0. 

Theorem 2.3. Let W satisfy the hypotheses (11.2p . (11.31) and (11.4|) . For any initial datum 
Mo G & > 2(M d )> there exists a unique curve /i G AC^ oc ([0,oo); > 2{^ d )) satisfying 

dtn(t)+div(v(t)fi(t)) = in £>'((0,oo) xR d ), 
v(t) = -d°W(fi(t)), for a.e. t > 0, 
\\v(t)\\ L 2 Wt )) = Im'K*) for a.e. t > 0, 
with fi(0) = fiQ. The energy identity 

rb c 

\v(t, x)\ 2 dn(t)(x) dt + W(ji(b)) = VW(/x(a)) 



holds for all < a < b < oo. Moreover, the solution is given by a X-contractive semigroup 
S(t) acting on ,5^2 (1B^), that is fi(t) = S(t)fiQ with 

d w (S(t)fi , S{t)u ) < e- xt d w (vo, v Q ), V^o, G ^ 2 (M d ) • 

The unique curve given by Theorem 12.31 is called gradient flow solution for equation 

(2.5) dtn(t) = div (<9°W0u(i)M*))- 

Let us finally remark that weak measure solutions as defined in [CDFLSj are equivalent to 
gradient flow solutions as shown there. 

Characterizing the element of minimal norm d°W(p) is then essential to link the con- 
structed solutions to the sought equation f|1.5j) - (|l .6j> or (jl.5p - ([1.7p . This characterization was 
done in [CDFLSj for potentials satisfying (fP)l . ([Q]> . ([Ol) and (fi~8|) . being the potential W 
C 1 -smooth except possibly at the origin. Under those assumptions, the authors identified 

7 



d°W(n) as d°W * n, i.e. 



d°W*(i(x) = 




VW{x-y)dfx{y). 



The main results in the present work will show that this characterization can be generalized 
to convex potentials satisfying assumptions (|1.2|) . (|1.4p and the radial radial hypothesis (jl.9p . 
regardless of the number of points of non-differentiability of the potential W. In Theorem 
14.51 we show that under those conditions, the formula d°W([i) = d°W * \i also holds, and the 
equation takes the form (jl.lip , which generalizes the standard form of the interaction potential 
evolution (jl.5|) - (jl.6p . In the most general case, i.e. for general potentials satisfying only 
(|1.2p . (jl.3p and (jl.4p . we will obtain a characterization in terms of generic Borel measurable 
selections in <9W. Therefore, the next two sections are devoted to the study of the velocity 
field d W, in order to apply the abstract result above to the aggregation equation. Let us 
state the results: 

• Let W satisfy (TO}, CLU and ifO]) . Let fjt G ^ 2 (K d ). There holds 

(2.6) d°w(fx)= [ v(;y)dKv) 

for some selection 77 G d\W having the form 

(2-7) rj(x,y) = ^(ri 1 (x,y)+r] 2 (y,x)) , 

with the couple (r? 1 ,// 2 ) belonging to the joint sub differential <9W. This is shown in 
Theorem 13.41 

• Under the additional assumption (jl.8p . let \i G )• Then we have 

(2.8) d°W{fi) = r}*fi 

for some rj G dW. This is the characterization following from Corollary 14.21 

• Finally, when the further condition (|1.9|) holds, and the potential is convex (not only 
A-convex for a negative A) there is 

(2.9) d°W(n) = d°W * fi 

for all [i G &2(J& d )- Here, d°W is the element of minimal norm of the subdifferential 
of W. This is proven in the subsequent Theorem 14.51 

Remark 2.4. Substituting (|2.6|) . (|2.8p or (|2.9|) in (|2.5|) and applying Theorem 1 2. 3 \ one obtains 
the corresponding well-posedness result. In Section H] we will show that the selection r\ 
appearing in (j2.8j) (and thus also the one in (|2.6p ) is in general depending on (i. Therefore, 
in the case of (I2.8p . the dynamic will be governed by a velocity field of the form (jl.7p . where 
the selection depends in general on t. Similarly for the case of ()2.6|) . On the other hand, we 
stress that a consequence of the last characterization (|2.9p , when applying Theorem 12.31 is 
that the selection corresponding to the velocity v(t) does not depend on t. 

Remark 2.5. The joint subdifferential constraint (12. 7p has a natural interpretation: there is 
a symmetry in the interaction between particles (action-reaction law). 
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3. Characterization of the element of minimal norm in the subdifferential 

In this section and in the next we analyze the Wasserstein subdifferential of W and we 
prove the main core results of this work. 

Theorem 3.1. Consider a Borel measurable selection (f] 1 ,^ 2 ) G <9W, i.e., [r] l ,r] 2 ) : M. d x 

M. d — > M. d x M. d is a Borel measurable function such that (r] l (x,y),r) 2 (x,y)) G d~W(x,y) for 
every (x,y) G M. d x M. d . For any [i G & 2 (M. d ), the map 

(3.1) £( x ):=l[ (rj 1 (x,y) + V 2 (y,x))d f i(y) 

z JR d 

belongs to <9sW(/i). In particular dsW(fi) is not empty. 

Proof. Since W is A-convex, the subdifferential inequality (jl.lOp in this case can be written 
as follows 



(3.2) W(y 1 ,y 2 )-W(x 1 ,x 2 ) 



A,, 



> {(rj 1 (xi,x 2 ), r} 2 (x 1 ,x 2 )), (yi - x u y 2 - x 2 )) + - \{yi - X\,y 2 - x 2 

for every (x\, x 2 , yi, y 2 ) G (M rf ) 4 . Let fi, v G £P 2 (M. d ) and 7 G T( / u, v). We show that inequality 
(I2.3P holds. Considering the measure 7 x 7, we can write 



W(y) - W(n) = - f (W(yi, y 2 ) - W(x 1 ,x 2 )) d(j x j)(x 1 ,y 1 ,x 2 ,y 2 ). 

2 J(R d ) 4 

Hence, by (pT2|) . 



W(u)- W(/i) > ^ / [(V^i, ^2), (2/1 - xi)) + (?7 2 (xi, x 2 ), (2/2 - ^2))] d(7 x 7)(fJi, 2/1, ^2, 2/2) 
2 7(1 " 

A 

+ 4 



/ (1 2/1 ~xi\ 2 + \y 2 - x 2 \ 2 ) d(>y x 7)(xi,yi, x 2 , 2/2)- 

J(R d ) 4 



The last term is ^ C 2 (/j,, u; 7), so that a change of variables gives 
W(v)-W((J,)>- I I {r) 1 (x 1 ,x 2 ),(yi - x 1 ))d 1 {x 1 ,y 1 )d^{x 2 ) 

1 JR d JR d xR d 

+ \ f f iv 2 ( x i, x 2), (2/2 - x 2 )) d-f(x 2 ,y 2 )d/j,(x 1 ) + ^-C 2 (fi,v;j) 



{■n l (x, z), (y - x)) dq(x, y) dfj,(z) 



1 

2 

+ \ j f {V 2 (z,x),(y - x)) dj(x,y)dfi(z) + ^ C 2 (/i, 1/; 7) 
z JR d JR d xR d z 

i / (r/^a;,^) +77 2 (z,x))(i^(z), (y - x) ) c/7(x,y) + ^C 2 (/i,zv;7) 
/R d xR d \ ^ JR d / ^ 

as desired. □ 

In the case of a smooth interaction function W, there is a complete characterization of the 
strong subdifferential dsW(fi) which is single valued. 
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Proposition 3.2 (The smooth case). Let fi G &> 2 (R d ). IfW G C 1 ^ x IR d ) satisfies the 
assumptions (|1.2p . (|1.3p . and (11. 4p . t/ierz t/ie strong Wasserstein subdifferential is a singleton 
and it is of the form 



d s W(p) = \f ViW(',y)dii(y)} = \ [ V 2 W(y,-)d^y) 

lJR d ) lJR d 

\ [ {V 1 W(;y)+V 2 W(y,-))d f i(y)\ , 

1 JR d ) 



(3.3) 



where Vi (resp. V 2 ) are the gradients with respect to the first-d (second-d) variables of 
R d x R d . 

Proof. Since c?W(x, y) = {VW(i, y)}, by Theorem 13.11 and the symmetry of W we have that 
the right hand sides of (|3.3p are contained in dsW(fi). 

In order to prove the opposite inclusion, assume that £ G L 2 (R d , /x; R d ) belongs to dsW(fi). 
Let s G L 2 (R d , \i\ R d ) be an arbitrary vector field, v = s#/i and fit = (i + is)#//. Writing (j2.3|) 
in correspondence of the plan 

7t = (i,i + ts) # ^ 

between \i and //t, we have 

W(nt)-W(li)> [ (£(x),y-x)d lt (x,y) + ^C 2 ( f i,L H ;~f t ). 



Hence, for every t > 

— / (W(x + ts{x),y + ts(y)) - W(x, y)) x fi){x, y) 

zt JR d xR d 

> [ {Z(x),s(x))dn(x) + -t\\sf L2 , s, 

JR d z 

and, by a direct computation 

(3.4) 1 /" (W(x + ts(x),y + ts(y))-W(x,y))d(nx v)(x,y) 

11 JR d xR d 

(\(x + ts(x),y + ts(y))\ 2 - \(x,y)\ 2 )d(^ x fi)(x,y) 

<R d 

+ I (£(x),s(x)) dn(x) - A / (x,s(x)) d/j,(x) . 



A 

> — 

- 4t 



Since W is A-convex, the map 

t ^ j (W(x + ts(x),y + tefo)) - W(x, y)) - A + £ S (x), y + ts(y))| 2 - | (x, y)| 2 ) 

is nondecreasing in i, for i > 0. Taking advantage of the C 1 regularity and the quadratic 
growth of W, by the monotone convergence theorem, we can pass to the limit in (|3.4p as t 
goes to 0, obtaining 

(3.5) \ ! (VW(x,y),(s(x) jS (i/)))dOiX/i)(x,y)> / (£,s)o>. 

1 JR d xR d JR d 
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Since by the symmetry of W we have ViW(x,y) = V2W(i/,x) for any x,y £ W 1 , we can 
write 
1 



'XJI 



(VW(x, y), (s(x), s(y))) d(ji x //)(x, y) 

id 

4/ (ViW(i, y), s(i)} x y) + - / (ViW(y, x), s(y)) x //)(x, y) 
z jR d xR d 1 



(ViW(x, y), s(s)) d(n x /x)(x, y). 
This way, (|3.5p becomes 

ViW(x,y)d j u(y),s(x) ) dpt(x) > / (£,s)d/x, 

I JR d 



that is 



/" / / ViW(x, y) - £(x), s(x)\ dfi(x) > 0. 

JR d \JR d I 



Since s £ L 2 ( M d , /i; M d ) is arbitrary we conclude that £(x) = j Rd ViW(x, y) dfi(y) as elements 
of L 2 (R d ,fi;W l ). □ 



When we drop the assumption of W £ C 1 (M. d x the sub differential of W is in general 
multivalued. In the next Theorem, we characterize the element of minimal norm in the 
subdifferential of W at the point fi, which is of the form (|3.ip . 



Remark 3.3. The property stating that every element of the subdifferential of W at the 
point fj, is of the form (|3.ip for a suitable Borel selection of the subdifferential of W could be 
in general very difficult and we do not know if it is true. 



Theorem 3.4. Let \i £ &2(M ) an< ^ £ = d°W(fj,). Then there exists a measurable selection 
(rj 1 , rf) £ dW such that 

£(x) = \l (VO^y) +r/ 2 (y,x))d/i(y). 

z JR d 

The proof of Theorem 13.41 needs several preliminary results. We will make use of a se- 
quence of regularized functionals W n . We recall that the Moreau-Yosida approximation of 
the function W is defined as 

(3.6) W n (x,y) := inf (w(u, w) + - |(x - v, y - w)\ 2 }. 

(v,w)<=R d xR d l- 2 > 

We have W n (x,y) < W(x,y) for every (x,y) £ M d x R d , W n £ C 1,1 (R <i x R d ) and the 
sequence {W n } n6 N converges pointwise and monotonically to W as n — > oo. 

If the A-convexity property of W is satisfied, there exists a constant K > such that 

(3.7) W(x,y) > -/f(l + |x| 2 + |y| 2 ), 

and in this case we can show the corresponding bound for W n , uniformly in n. 

Proposition 3.5. If W enjoys the X-convexity assumption (|1.3|) . there exist n £ N and a 
constant K > such that for all n > fi 

W n (x,y) > -R{l + \x\ 2 + \y\ 2 ). 
n 



Proof. Indeed, by (|3.6p and the estimate fj3.7|) there holds 

(3.8) W n (x,y)> inf \ - K(l + \v\ 2 + \w\ 2 ) + ^ \(x - v,y - w)\ 2 } . 

Here we compute the minimum. The gradient of the right hand side is 

{—2Kv — n(x — v), —2Kw — n(y — to)), 
and it vanishes when (v,w) = n ™ 2 K ^ x iV)- Substituting in (|3.8p we get 

W„(x, y) > -K (l + (|*| 2 + |,| 2 ) + (N 2 + |y| 

If we chose for instance K = 2K, it is then clear that there exists a large enough n (depending 
only on K) such that the desired inequality holds for any n > n. □ 

We define the approximating interaction functionals 

(3.9) Wn(fi) := \ I W n (x, y) d(ji x (j)(x, y). 

Remark 3.6 (Semicontinuity properties of W). Since W might enjoy a negative qua- 
dratic behavior at infinity, it is not true that W is lower semicontinuous also with respect 
to the narrow convergence. By the way, it is shown in [CDFLSl §2] that one can choose To 
small enough (depending only on W) such that for any t < tq and for any \i G ^2(^)1 the 
functional 

(3.10) u^W(u) + ^-d 2 w (u,fi), 

is lower semicontinuous with respect to the narrow convegence. Moreover, for r < To, mini- 
mizers do exist for (|3.10p . The arguments of [CDFLSj are given for a function W such that 
W(x,x) = for any x and assumption iv) holds. They can be adapted in a straightforward 
way if these hypotheses are omitted. Moreover, in the case of the approximating functionals 
W n defined in (|3.9p . we can choose To independently of n. Indeed, since Proposition 13.51 gives 
the bound W n (x,y) > —K(l + \x\ 2 + \y\ 2 ), for some K > 0, it is enough to choose tq small 
enough such that 

v / ~K(l + \x\ 2 + \y\ 2 ) dv x u(x, y) + — d^ifi, v) 
JR d xR d 2r 

is narrowly lower semicontinuous. 

We prove the following more general lower semicontinuity property. 

Proposition 3.7. Let tq be as in Remark 13.61 Let t < tq and let v G ^(IS^)- For 
any \i G &20& d ) and for any sequence {/J, n }n<EN such that \x n narrowly converges to fj, and 
sup n J Rd \x\ 2 dfi n (x) < +00, there holds 

W(/x) + dwifi, v) < liminf (WniVn) + ^ d 2 w {\x n , u] 

Moreover, for \t G ^ 2 (K d ) there holds W n (/i) -> W(fi). 
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Proof. Since W n > if n > k, by Remark 13,61 we have 

liminf ( W„(^„) + ^- c^(/i n , v ) I - nmmf ( WfcO"n) + — o%(ii n ,v] 

for any fixed k G N. Now we shall pass to the limit as k — > oo. Notice that /* W pointwise 
and monotonically, and thus by the monotone convergence theorem, Whip) converges to 
W(/u). Both statements are proven. □ 



We recall a suitable notion of convergence of a sequence of vector fields £ n G L (R , fi n ; 



Definition 3.8. Let [i n narrowly convergent to \x and let £ n G L (R , /i n ; R ). W^e say i/ia£ 
£ n weakly converges to £ G L 2 (R d ,^;R d ) i/ 



(3.11) / (£ n , C) a>„ -> / (I, C) dp, VC G C; 

We say i/iai i/ie convergence is strong if (|3.1ip ZioZds and 

/ l^nP^n ->• / l^l 2 ^- 
JR d JR d 

Remark 3.9. Consider the set Aid of vector measures of the form £fi, with fi G .^(R'O and 
£ G L 2 (R d ,//;R d ). Reasoning as in |AGSl §5.1], we know that the weak convergence (|3.11j) is 
metrizable on every subset A of Aid such that 



sup / |£| dfi < +oo. 

A JS. d 

Moreover, by \AGS\ Theorem 5.4.4], if A C Aid is such that 

SU P / l£| 2 dfJ- < +oo, 
A Jm d 

then A is also compact with respect to the weak convergence (|3.1ip . 

We also need to define the barycentric projection. 

Definition 3.10 (Disintegration and barycenter). Given f3 G T(/i, v), we denote by (3 X the 
B orel family of measures over ^(R rf ) such that /3 = f^ d (3 X d/j,(x) , which disintegrates ft with 
respect to fi. The notation above means that the integral of a Borel function ip : R rf x R rf — > R 
such that ip G L 1 (/3), can be sliced as 



ip(x,y)d/3(x,y) = / ip(x,y) d(3 x (y) dfi(x). 

<R d JR d JR d 

The barycentric projection of /3 G v) is defined by 

P{x) := [ yd(3 x (y). 



For more detail about disintegration see |AFP1 Theorem 2.28]. 
We can prove the following simple 
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Proposition 3.11. Let{fj, n } C @> 2 {^ d ), {v n } C s 2 (^ d ) be sequences with uniformly bounded 
second moments and narrowly convergent to [i and v respectively. For every choice j n £ 
F (fj, n ,v n ), we have that the sequence {7™} is tight and every limit point with respect to the 
narrow convergence in > {M. d x R d ) belongs to T (fi,v). Moreover, if 7 is a limit point and 
7 nfc is a subsequence narrowly convergent to 7, then 



Proof. The tightness and optimality are contained in |AGS|, Proposition 7.1.3]. Let {^ n )x 
be the disintegration of 7„ with respect to fi n and j x be the disintegration of 7 with re- 
spect to fi. Let C € C c °°(]R d ;]R d ) and / : R d x R d -»• R the function defined by f(x,y) = 
(C(x),y). Since / is continuous and satisfies \f(x,y)\ < C\y\ for every (x,y) S R d x 
R d and sup n / RdxRd (|x| 2 + \y\ 2 ) d^ n {x, y) < +00, by [AGS} Lemma 5.1.7] we have that 
J R d xR d f{x,y)dj n (x,y) -)• f RdxRd f(x,y) dj(x,y) as n -)• +00. Using this property and the 
definition of bary center, we have 



We recall a definition from |AGS( Chapter 10]. 
Definition 3.12 (Rescaled plan). Let r < tq (as in Remark |3.6|) . Given \i G £?i^R ), let 



Next we introduce an abstract result about approximation of the minimal selection in the 
sub differential of W. The argument is indeed a direct consequence of the analysis in [AGS} 
§10.3], but requires the concept of plan subdifferential. Since this is a technical definition, we 
prefer to postpone a discussion at the end of the paper. Therefore, the proof of the following 
proposition is given in the appendix. 

Proposition 3.13. Let fj,, fi T and 7 T be as in Definition 13.121 Then \i T — > \i in &2(J&- d ) as 
t — y 0. Moreover, denoting by j T the barycenter of j T , we have that 7V £ ds~W([i T ) and 

TV — > d°W((J,) strongly in the sense of Definition 13.81 as r — > 0. 

Making use of Proposition 13.131 we can prove the following 

Lemma 3.14. Let fj, £ ^(B^)- There exists a sequence {/J. n }neN C ^ > 2^ d ), with f Rd \x\ 2 dfi n < 
+00, such that fj, n narrowly converges to fi and 



7n fe —> 7 weakly in the sense of Definition 13.81 as k — >■ +00. 




□ 





ViW n (-, y) dfj, n (y) — > d°W(fi) weakly in the sense of Definition 13. 81 
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Proof. Let tq be as in Remark l3.6l and consider a measure \i G &2(M- )• Define, for r < f < tq, 

l4 = argmin S.W h (u) + <^(^m) : v G 5» 2 ( 
By the uniform estimates of [AGS! Lemma 2.2.1], 
(3.12) sup / |x| 2 d[i h T < +00. 

h£N,r<f JRd 

This shows that the sequence {/i^j^gN is tight and has bounded second moments. Let \i T be 
a narrow limit point. Proposition 13.71 yields, for any v G 



VW(Mr) + TT dw(fl T , fi) < Hm hlf Wfc(^r) + 7T <W?, M) 
/T h—too \ ZT 

< liminf ( + — d^-(i/,p) 

\ It 

This shows that /i T is indeed a minimizer for d3H0D. Let 7^ G r o (/x£,/x) and let 7^ be the 
corresponding rescaled plans (see Definition 13. 12p . If {h(m)} m ^ C N is a sequence such that 

fi T y narrowly converges to /x T , since 

m eN has uniformly bounded second moments 
due to (|3.12p . by Proposition 13. 1 11 we have (possibly on a further subsequence that we do not 
relabel) the narrow &(M. d x M. d ) limits 

and also — y 7 r weakly in the sense of Definition 13.81 Let d be a distance which metrizes 
this weak convergence. Indeed, it is metrizable thanks to Remark 13.91 since 

\^ m \x)\ 2 d^ m \x) < [ [ \y\ 2 d{ 1 h T ^) x (y)d^ m \x) = [ \y\ 2 d 7 ^(x,y) 



r| V M (y) = ^4r04t (m) ,M) 



r 2 "It \»f T 2' 

which is uniformly bounded in m for any fixed < r < f due to (|3.12p . 

Then, if r(n) is a vanishing sequence, we can extract a further subsequence {h(n)} n< z^ from 
{/i(m)} m6 pj such that, 

d (%S>7r(n)) < \- 

Then we have 

d(7jg,S°W(A0) < d(7^,7r W ) + d(7r(n),a°W(^)) 

< i + d(7 T(n) ,a°w(^)) 

Invoking Proposition 13.131 we know that 7 T converges to i9°W(/x) weakly in the sense of 

Definition 13.81 as r — > 0. Hence, passing to the limit as n — > 00, we see that j n := 7^S 

weakly converge to d°W(n) in the sense of Definition 13.81 
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Finally, by Proposition 13.131 we know that, for any n, 7 n G <9sW n (/i n ). Since W n is C > , 
by the characterization of strong sub differential of Proposition 13.21 we have 

%(x) = / ViW n (x,y)dfj,(y) 

JR d 

and the proof is concluded. □ 

Proof of Theorem 13. 41 Let /i G ^(B^); £ = d°W(y) and ^ n the sequence given by 
Lemma 13.141 By Proposition 13. 2\ the only element of SsWn^n) is given by 



€n(x) ■= / V 1 W n (x,y)dfi n (y). 

JR d 



Let us consider the maps (i, VW n ) : R d x R d — > (M d ) 4 given by 

(i,VW n )(x,y) = (i,j,VW n (i ) y)). 

Introducing the measures 

i/ n := (i, VW n ) # (^ n x /x n ), 
by Lemma f3. 141 we have, for any ( G C^°(M. d ) 



(3.13) 



(v,((x)) dv n (x,y,v,w) = / (ViW n (x,y),C(x))d(n n x ix n ){x,y) 

) 4 JM d xIR d 

jR d Jm d 

The sequence v n is tight. Indeed, from the quadratic growth of W and W n at infinity and 
the boundedness of fi n in ^2 (K ) we obtain the uniform estimate 

sup / \x\ 2 d(y{W n #n n ){x) = sup / |ViW n | 2 d^ n (x) < +00, 

n£N JR d n jR d 



which implies the tightness of the marginals of v n . Then we can extract a subsequence (that 
we do not relabel) narrowly converging to some v G &>((R d ) 4 ). Moreover, for ( G Cfi°(R d ), 
due to the linear growth of the integrand we have 

lim / (v,((x))du n (x,y,v,w) = (v, ((x)) du(x, y, v, w). 

n ^°°y(iR d ) 4 J(R d ) 4 

The narrow convergence of measures implies that supp(z/) is contained in the Kuratowski 
minimum limit of the supports of v n (see for instance [AGS I Proposition 5.1.8]), i.e. for ev- 
ery (x,y,rj) G supp(^) there exists a sequence (x n ,y n ,r] n ) G supp(u n ) such that (x n ,y n ,rj n ) 
converges to (x,y,rj). Since, by definition of u n , supp(^ n ) C graph(9W„), then supp(z^) C 
graph(dW). Indeed rj n G dW n {x n , y n ) and passing to the limit in the subdifferential inequal- 
ity we obtain that rj G dW(x,y). 

Disintegrating v with respect to (x,y), we obtain the measurable family of measures 
(x,y) 1 y v x ^ y such that 



/ 



Vx, y d(fj, x n)(x,y). 

16 



It follows that supp(v Xt y) C &W(x,y). In the limit 



= nf (v,((x))dv(x,y,v,w) + - [ (w,((y))dv(x,y,v,w) 
= \ I (l I vdv Xj y(v,w)dfJ,(y),((x) ) dfi(x) 

z JR d \JR d JR d xR d I 

+ 0/ (/ / wdu Xt y{v,w)dn(x),C(y)) dn(y) 

1 JR d \JR d JR d xR d I 

= I ([ I o ( v dvx,y+ wdv y>x )(v,w)dfi(y),(;(x)) dfj,(x). 

JR d \JR d JR d xR d 1 I 

Here we made use of the symmetry of W, which entails 

<P(v)p(x) dis(x, y,v,w) = / cj>(w)ip(y)di>(x,y,v,w) 



for any functions <f), ip, by using this latter equality for v n (since W n is symmetric) and passing 
it to the limit. Defining 



rf l {x,y) := / vdv x>y (v,w), and r] 2 (x,y) := / wdv Xty (v,w), 

JR d xR d JR d xR d 

we obtain 

lim / (€n{ x )X(x))dHn{x) = / (/ r (Vfoi/) +r] 2 (y,x)) d/j,(y)X(x) ) dfi(x). 

n ^°° JR d JR d \JR d 2 I 

By d535D , we get 

€(x)= I r {^{x.y) +r l 2 (y,x)) dfi(y) 

JR d z 

On the other hand, we proved that supp(v Xty ) C dW(x,y). Since r] 1 (x,y),r] 2 (x,y) are 
barycenters and <9W(x,y) is convex, we have that [r] 1 (x,y),r] 2 (x,y)) E <9W(x,y). □ 

4. The convolution case 

4.1. The case of W depending on the difference. In the case of assumption (jl.8p we 
can particularize the results above. 

Lemma 4.1. Let (jl.8p hold. (r] 1 ,r] 2 ) E 9W if and only if there exists rj E dW , such that 

(v\v 2 ) = (v,-v)- 

Proof. Assume that W is convex, the general case follows considering x 1— >■ W(x) — ^|:c| 2 . 
If rj E cW we have for every (x,y) E M rf x R d 

W(x, y) - W(x, y) = W(x - y) - W(x - y) > {rj(x - y),x - y - (x - y)) 

= (iv(x - y), -v(x - y)) , (x, y) - (x, y)) , 

which means that (rj(x — y), — , n(x — y)) E 9W(i, y) by making the abuse of notation rj(x, y) = 
rj{x - y). 
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On the other hand, if (77 1 , r] 2 ) G <9W, then for every (x, y) G M. d x IR rf we have 
W(x -y)- W(x -y) = W(x, y) - W(x, y) > ((^(x, y), r] 2 (x, y)) , ((£, y) - (x, y))) 

(4.1) = (r] 1 (x,y),(x-x)) + (ij 2 (x , y) , (y - y)) . 
Assuming in particular that x — y = x — y the inequality above reduces to 

> (^(x, y) + rj 2 (x, y),(y- y)), 

and the arbitrariness of y implies that 77 1 = —rj 2 . Using this relation in (|4.ip we obtain 

W(x -y)- W(x -y)> {r) l {x, y), (x - y) - (x - y)) , 

which means that 77 1 G dW. □ 

Applying the main results of the general case we obtain the following 

Corollary 4.2. Let assumption (|1.8|) hold. If 77 is a Borel measurable anti- symmetric selec- 
tion of dW, then for any fx G & 2 (J^ d ), 

(4.2) r J (x-y)d(i(y) = ri*LiedsW((i). 

JR d 

Conversely, if £ = d W(fi), then there exists a Borel measurable anti- symmetric selection 
77 G dW such that 



£(x)= rj(x - y) dfj,(y) = rj * fi. 

JR d 



Proof. Let 77 be an antisymmetric selection in dW . In particular rj(x — y) = —T}{y — x) for 
every x,y G M. d and by Lemma 1431 (r? 1 (x. y), r) 2 (x,y)) := (rj(x — y), rj(y — x)) G dW(x,y). 
Applying Theorem 13. II to the Borel measurable selection (f] 1 ,^ 2 ) just defined, we get (|4.2p . 
Conversely, assuming that £ = d°W(Li), by Theorem 13.41 and Lemma 14. II we obtain that 

€(x)= I \{v 1 {x-y)-v 1 {y-x))d L L{y). 

JR d z 

By choosing rj(z) = \ (?? 1 (^) — ^? 1 (— z)), we conclude. □ 

Remark 4.3. We observe that if 77 is a Borel measurable selection of dW such that for any 
11 G &> 2 (R d ) 

£{x) := \ r)(x -y)d/j,(y) = 77 * 11 G d s W(n), 

JR d 

then 77 is antisymmetric. Indeed, let 77 G dW be such that 

W(u)-W(li)> ( r]{x-y)dLi{y),z-x) dj{x,z), 

JR d \JR d I 

for 7 G r(/i, z^). Choosing /i = 5 X1 and v = 8 X3 , 7 = fj, x y, the inequality becomes 

> (77(0), x 2 -xi), 

and since this must hold for any x\,x 2 G M. d , we deduce 77(0) = 0. Moreover, taking into 
account that 77(0) = 0, if il = ^5 Xl + \o~x 2 i y — ^x 3 + ^b~x 4 , with \x\ — x%\ < \x± — X4I and 
\x 2 — Xi I < \x2 — xs\, the sub differential inequality reduces to 

W(x 3 - x^ - W(xi - x 2 ) > {rj(xi - x 2 ),x 3 - xi) + (77(2:2 - xi),x 4 - x 2 ). 

18 



In particular, for x 3 — x 4 = X\ — X2 we get 

> (r](xi - x 2 ) + rj( x 2 - xi),x 3 - xi), 
which yields T](x± — X2) = —f]( x 2 — x i) for any x\,X2 G R d . 
Remark 4.4. If /i <C C d we can conclude that 

rj(x - y) dfi(y) G dW(fi) 

for any Borel selection 77 in dW. Indeed, in this case the set where dW is not a singleton is 
//-negligible. That is, in the integral we can restrict to the points where W has a gradient 
(there is no need to select), and in that case S/W*fi belongs to the Wasserstein subdifferential 
of W at fi (it is actually its minimal selection) , as shown in [CDFLSj . 

4.2. The radial case. In the radial case, we are able to give a more explicit characterization 
of the minimal selection of the Wasserstein subdifferential. Before stating our theorem, we 
recall that 

d°W(x) = argmin{|y| : y G dW(x)}. 

We have the following 

Theorem 4.5. Let W be convex and such that (j 1 . 9 j) holds. Then 
(4.3) d°W(n) = {d°W) *n V/j G ^ 2 (M d ). 

Proof. Let fi G .^(B^)- Joining together the results of Theorem 13.41 and corollary 14.21 we 
know that the d°W(fJ,) has the form of a convolution with an anti-symmetric selection in the 
subdifferential of W. Hence, in order to find the explicit form of d°W(n), we have to minimize 
the quantity 

2 



\V*V\\ L 2( 



rj(x - y)dfi(y) 



d/i(x) 



among all measurable anti-symmetric selections rj in dW. Let us introduce some notation. 
We define, for any x G v G M. d , the singleton sets 

A+(x) := {x - v}, A~{x) := {x + v}. 

Note that by antisymmetry of r], we get 



(4.4) 



A±(x) 



r/(x - y) d(i(y) = ±fi ({x =f v}) rj(v) . 



Fix a point v G R , different from the origin. We have 
2 



r](x - y)dfj,(y) 



r](x - y) dfi(y) + 



AZ(x)uA v (x) 



R d \(A+{x)LlA-(x)) 



rj(x - y) dfi(y) 



We suitably expand the square as (a + b + c) 2 = a 2 + 2a(6 + c) + b 2 + 2b(a + c) — 2ab + c 2 . This 
way, since A+(x)U(R d \(A+(x) U^(a;))) = R d \^"(x) and A~ (x)U (R d \{A+ (x) UA~{x))) = 
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M. d \ A^(x), the right hand side can be rewritten as 



AZ(x) 



rj(x - y) dfj,(y) 



+ 2 I r](x-y)dfi(y), \ 

\Ja~(x) Jm 



R d \A~(x) 



r/(x - y)d{i(y) 



A v (x) 



rj(x - y) d[i(y) 



+ 2( / rj(x-y)dn(y), rj(x - y) dfx(y) 

IA+(x) JR d \A+(x) I 



-2( / v(x-y)dfi(y), rj(x - y) dfi(y) ) + R{x) 

\JA+(x) JA~(x) I 

where the remainder term R(x) is given by 



R(x) 



rj(x - y) dn{y) 



lRd\(A+(x)UAv(x)) 

so that it does not depend on the values of rj at ±v. Using (14. 4p . we are left with 



rj(x - y) dfi(y) 



{^At(x))+^(x))) 2 \ V (v)\ 2 



+ 2( ( V (x-y)dfi(y), [ 
\Ja£(x) Jm 



R d \A+(x) 



r](x - y)dfi(y) 



f2( / rj(x -y)d/j,(y), rj(x - y) d/j,(y) ) + R(x) 

lA~(x) JR d \A-(x) / 



Making use of the computed identity, we find 



\\v*t*\\ 



{^At{x))+^A-{x))f d f i(x))\r ] (v)\ 2 + n 



+ 2 



rj(x - y) dfi(y), / rj(x - z) d/i(z) ) dfi(x) 

A+(x) JR d \A+(x) 



+ 2/ I rj(x - y)dn{y), I rj(x - z) dfi(z) ) dfi(x) 

\Ja-(x) JR d \A-(x) I 

{^{At{x))+^A-{x))) 2 dn(x)) \v(v)\ 2 + TZ 



f2 / / (ri(x-y), rj(x - z) dfi(z) ) dfi(x) dfj,(y) 

'{x-y=v} \ J {z£M d :z^x-v} 



+ 2 



{x-y=-v} 



(rj(x-y), I 



{z£R d :z^x+v} 



rj(x - z) d/i(z) ) dn(x) dfx(y), 



where 1Z = j Rd R(x) dfj,(x). Notice that the integrals with respect to z can be equivalently 
taken on the set {z £ R rf : z ^ x,y} in the last two lines. Indeed, in the first one the equality 
x — y = v implies {z £ M. d : z 7^ x — v} = {z 6 M. d : z 7^ y}. Similarly for the second one. 
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Moreover, we can also neglect the set {z = x} since tj(0) = by anti-symmetry. We exchange 
x with y in the last integral, so that 



( V(x ~ y), / V(x - z) dfi(z) j dn{x) d/i(y) 

{x-y=-v} \ J{z^x,y} / 



r)(y -x), rj(y- z) dfj,(z) \ dfi(y) dfi(x) 



{y-x=-v} 



( ~v(x -y), V(y~ z ) dfi(z) ) d/i(y) d/j,{x) 

{x-y=v} \ J{z=/=x,y} / 



Hence from the previous computation we get 



= (J (KAt(x))+fi(A~(x))) 2 dfi(x)^j \rt(v)\ 2 +Tl 



(4.5) +2 // / (v(v),v(x- z)-rj(y- z)) dfj,(z)dfj,(x)dfj,(y). 

J J {x— y=v} J {z^=x,y} 

On the set {(x,y) 6 M. d x ]R rf : x — y = v}, we rewrite the interior integral in the following way 
/ (v(v), V(x ~ z) - rj(y - z)) dfi(z) = / (rj(v), t]{x - z) - rj(y - z)) dfi(z) 

J {z=£x,y} J {z=/=x,y,x+v,y-v} 

+ (v(,v),r](2v) - rj(v)) dfl(z) 

J {z=x+v}U{z=y— v} 

= / (v(v), V(x ~ z) - rj(y - z)) dfi(z) 

J {z^=x,y,x+v,y-v} 

+ (KA-(x)) + KAt(y))) ((!?(«), n(2w)> - \r,(v)\ 2 ) 

and thus 



(ri(v),rj(x - z) - rj(y - z)) dfi(z) d/i(x) d/i(y) 

{x-y=v} J{z^x,y} 



{rj(v),ri(x - z) - r){y - z)) d(j,(z) dfi(x) d/j,(y) 

{x—y=v} J {z^x ,y,x+v ,y—v} 



+ 2^y(At(x)UA-(x)))d^x)j ((v(v),v(2v)) - \rj(v)\ 2 ) . 
Here, in the last term we used 

// v(A~ (x)) dfi(x) dfi{y) = // fi{At(y))dn(x)dfi{y) = / fi(A+ (x))fi(AZ (x)) dfi{x) . 

J J {x—y=v} J J {x—y=v} J 

Substituting in (|4.5p we get the final expression 

Wv * vWlw.m = ( f i»(A+(x)) - KA~( X ))) 2 dn(x) ) \v(v)\ 2 + n 



+ 2 I / (rj(v),r](x - z) -r)(y - z)) dfi(z) dfi(x) dfi(y) 

'{x—y=v} J {z^x,y,x+v,y— v} 



(4.6) +4^J^(A+(x)) l x(A-(x)))d l x(x)^ ( V (v),r,(2v)). 
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Since W is radial, for all v G M d , tj(v) takes the form 
(4-7) »»(«) = 

where 77 denotes the generic selection in the subdifferential of the profile function w in (jl.9p . 
Convexity implies that 77(H) — 0) an d thus 

(4.8) (77(f), -y) = |?7(i>)| |«| 

for all v G The convexity hypothesis gives 

(x - y, 77(2; - z) - i](y - z)) > 0, 

for any x,y,z. Combining this with (|4.7p since f?(|f|) > 0, we conclude that 

(rj(x - y),r)(x - z) - rj(y - z)) > 0. 

From this last inequality it is readily seen that the scalar products appearing in (j4.6j) are 
(pointwise) nonnegative. Therefore, since the remainder 1Z is independent on rj(v) and rf(—v), 
it is clear from (j4.6|) and f|4.8j) that ||i7 * /-i||^ 2 ( K d ^. K d) at least does not increase if we choose 
the value of r] at v to be the one that minimizes (77(7;) |. By the arbitrariness of v G M. d we get 

(USD - □ 

Example 4.6. The above result fails if we omit the convexity assumption. Indeed, let us 
consider a 1-dimensional example. Let 

W(x) = - \x 2 - 1|. 

Notice that this function is radial and — 1-convex, and its subdifferential is 

{x for \x\ > 1, 
—x for \x\ < 1, 
[-1,1] fors = ±l. 

Let us consider the measure [i = \5 X1 + \5 X2 + \8 X3 . We have to minimize the quantity 

^33 

j=i t=i 

among all measurable antisymmetric selections rj in dW . 

If we let x\ = 1, X2 = 0, X3 = 3/4, the only points where it is needed to select are ±1, 
corresponding to ±(#i — X2), hence expanding the sum above (using the antisymmetry) it is 
clear that we reduce to find the minimizer of 

min{r/(xi - x 2 ) 2 + T)(xi - x 2 )(rj{x 1 - x 3 ) - rj(x 2 - x 3 )) : 77(11 - x 2 ) G [-1, 1]}. 

Here x\ — x 3 = 1/4 (so that r)(xi — x$) = —1/4) and x 2 — x% = —3/4 (so that r](x 2 — x 3 ) = 3/4). 
Then, letting y = rj(xi — x 2 ), we are left with the problem mm{y 2 — y : y G [—1, 1]}, whose 
solution is y = 1/2. This is different from the element of minimal norm in dW(x 2 — x\), 
which of course is 0. 

We also point out that in this non convex case the choice of the selection is not independent 
from the measure [i. Indeed, if we change the value of X3 to be, for instance, —1/4, we have 
77(2:1 — X3) = 5/4 and 77(^2 — x 3 ) = —1/4, then we have to solve min{y 2 + 3y/2 : y G [—1, 1]}, 
and the solution is y = rj(l) = —r/(—l) = —3/4. 
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Example 4.7. The result of Theorem 14,51 fails if we omit the radial hypothesis on W. As 
a counterexample we provide a convex function W satisfying all the assumptions (]1.2p . (jl.3p . 
(fill . (fUD and a measure pL G ^ 2 (K 2 ) such that 

Let the graph of W be a pyramid with vertex in the origin, with varying slopes, given by 

(1,0) for < x < 1, -x<y<x, 
(6, 0) for x > 1, —x < y < x, 



VW(x,y) 



where 9 > 2. W is then defined by symmetry, such that its level sets are squares centered in 
the origin. Let fi = ^S Xl + \5 X2 + ^S X3 , where 

X 2 -X! = (1,1), x 3 -x 2 = (-l/2-e,l/2), x 3 - Xl = (1/2 - e, 3/2) . 

Among these points, for small enough e > 0, <9VF is not a singleton only at X2 — Xi, and in 
particular it is the convex set K of R 2 defined MS 

K = {(x, y) G R 2 : x>0,y>0, l-x<y<9-x}. 

We let rjij, i,j = 1, 2, 3 denote the generic element of dW at Xi — Xj. In this particular case 



<9°W(/u) = argmin < 



1 3 
27 f-f 



i=l 



Vi 



Since = VW(x\ — X3),rj 23 = VW(x2 — xs), we are left to minimize with respect to the 
unique variable r/ 2 i, that is, the minimization problem above reduces to 

| - T) 21 + r/isl 2 + 27 1^23 + *?2l| 2 + ^1^732 + *?3l| 2 - 

We have ?7 13 = (0, —6) and J723 = (1,0), and hence it is immediate to check the solution is 
the minimizer of 



(4.9) 



l%i| 2 + (*?2i,(M)>- 



If t] = d°W we have that rj 2 i is the element of minimal norm in K, that is (1/2, 1/2), and in 
this case the quantity above takes the value 1 + 9/2. But 6 > 2, so that the minimum value 
in K of the quadratic expression (|4.9p is 2, attained in a different point, that is rj 21 = (1, 0). 



5. Particle system 

As in [CDFLSj . the well-posedness result in Theorem 12.31 of measure solutions allows to 
put in the same framework particle and continuum solutions. Assume that we are given N 
pointwise particles, each carrying a mass rrii, with rrtj = 1. Let x%(t) be the position in ~R d 
of the i-th particle at time t. Let r q 1 (x,y) : R d x R d -)■ R d and rf(x,y) : R d x R d ->■ M d be 
selections in the subdifferentials 5iW and respectively. We consider the system 

= ^^2 m j( r 1 1 ( x j ~ Xi) + rf{xi- Xj)), i = l,...,N. 

23 



If the %{ are absolutely continuous curves, the empirical measure n(t) = Y2i=i m Ai(t) 
the PDE 



solves 



d_ 

di 



1 



div 



0. 



Among these selections, we have the minimal one in 9W(/j), hence we have correspondence 
between our equation (|2.5p and the ODE system above. In particular, if we are in the 
framework of assumption (|1.8|) , the velocity vector field of the continuity equation is written 
as a convolution and the corresponding ODEs takes the form 



TV 



(5.1) 

where rj^ 



dxi \ - 



1. 



.N, 



i=i 



r)(xi — Xj) and rj is the suitable measurable anti-symmetric selection in dW. 



Remark 5.1 (Characterization of the element of minimal norm in the radial convex 
case for particles). In the particular case of a system of particles, it is more immediate to 
see how the proof of Theorem 14.51 works. Actually, it was the origin of the idea in how to 
get the proof in previous section. We include it since we believe it is an instructive proof. 
Suppose that W is convex, so that the minimal selection in <9W(/u) is expected, after Theorem 
3J to be d°W * fj,. In fact, we have in this case 



/v 



},mi5 Xi (x), m 1 + ...+m N = l, 



i=l 



where X{ £ After Theorem 13. 44 we know that we have to search for the Borel antisym- 
metric selection r\ in the subdifferential of W that minimizes the L 2 (M rf , /x; M. d ) norm of rj * \i. 
We have 



I*7*mIIl2 ( 



N 

E 



mi 



N 



y^ j m i r](xj,x i 



i=i 



For simplicity we are letting again rj(xj,Xi) = r)^. Hence we have to solve the problem 





c 

N 


TV 


2 


min < 






: rj G dW, rj^ 






i=l 





Vi 



Suppose first that W is differentiable in all the points xj — Xi- In this case we have rj 



VW{x 3 



and the quantity to minimize simply takes the (single) value 



TV 



TV 



miVw(xj — Xi) 



i=l 



which is then the square norm of the minimal selection in dW. Otherwise, taking advantage 
of the anti-symmetry of each selection, let us rearrange the terms highlighting r]jj, where / 
and J are fixed nonequal numbers between 1 and N, and let xj — x\ = xq. In order to keep 
simple, we make the assumption that no other couple is such that This 
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way, it is not difficult to see that, expanding the squares, we can write 



N 



(5.2) ^m, 



N 



i=l 



N 



m I mj(m I +m,j)\rij I \ 2 +2rri I rrij ^ m k (rjjj, (rj Jk - rj Ik ))+R, 



fc=i 

fc^J, I 



where R does not depend on the value of rj at ±xo- We claim that each scalar product in 
the sum above is nonnegative, since by convexity of W , any selection rj^ in dW(xi — Xj) is 
monotone in the sense that 

(Vjk ~ Vik,Xj -x k -Xi + x k ) = (r) jk - r} ik , Xj - x^ > 0. 

By assumption v), W is radial (W(-) = w(\ ■ |)), and if rjj i is a selection in dw(\xj — Xi\), then 
r]j i is positive by convexity of W and 

(Vji,Vj k - Vik) = I -3—1 (xj - x u rj jk - rj ik ) > Vi, j, k. 

X n X j 



This shows that the claim is correct. Then if we minimize in (|5.2p only with respect to 
the admissible values of rjjj, we see that the minimum argument is found if we take 17 jj = 
d°W(xj — xi) (and the same for rjjj, by anti-symmetry). But by construction, such minimum 
argument does not depend on the value of r\ in the other points. We conclude that if t] is 
a solution of the full minimization problem, then r/jj is forced to be the element of minimal 
norm in dW(xj — xi). By the arbitrariness of /, J we conclude: the solution is d°W * \i. 

Remark 5.2 (Collapse). In the radial-convex case, the finite-time collapse argument in 
[CDFLSj carries over. Indeed, it is enough to substitute VW therein with the new object 
d°W. Consider the equation (|5.ip for a system of N particles. We notice that, since d°W 
is anti-symmetric, the center of mass Ylj=i N x j does not move during the evolution. We 
denote such point by x$. Then, assuming for simplicity that the weights are uniform, there 
holds 

d\X% Xo) 1 V - ^ r\n f I \\ Xi Xj 

T+ = at d w (\xi — Xj\)- J - 

dt N ' \xi — Xj\ 

Multiplying by x^ — xq we find 

/r o\ Id. .0 1 OQ ,, ,x \Xi — Xj, Xi — xq) 
(5.3) - — \Xi — Xn \ = > O W{ \Xi — Xj ) : — : . 

v ' 2dV 1 N ^ u 0V Xi-xA 

Now, let R(t) := maxjg.n ,...,N}{\ x i(t) — xo\}. If / 6 {1, . . . , N} is such that \xi(t) — xq\ = R(t), 
at that time we clearly have {xi — Xj,xi — xq) > 0. For any time and for any I with this 
property, from (|5.3p we see that the derivative of \xi — xq\ 2 is nonpositive, so that its maximum 
governs the evolution of R: 

1 d 2 _ 1 A (Xi- Xj,Xi- x ) 



-^-R 2 (t) = max — i d°w(\xi — 
2dt w {i-.Xi(t)=R(t)} N ^-f Vl J|; Xi-So 

1 n A, 

- { ,4)= x ^)} " n mtT) g < Si " ^ " Xo> 

< max — — \Xi — Xq\ = —R(t), 

~ {i-.x,(t)=R(t)} N2R{t) 1 U| 2N w ' 
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where 0, = mfm+oo) Here we used the elementary bound \xi(t) — Xj(t)\ < 2R(t) and 

the fact that Ylj=i( x i ~ x j> x i) = \ x i ~ x o\ 2 if ^ € {1, . . . , N} is such that = R(t). 

The solution then reaches the asymptotic state /Xoo = 5 Xo in finite time if £1 > 0. We can 
also generalize the result to the case O = under suitable assumptions on the behavior of u 
at the origin such that a Gronwall estimate can be achieved. For instance if we ask - w ^ 
to be decreasing on some interval (0, e) we are done. For all the details and a more general 
discussion on this issue we refer to [CDFLS] . We only remark that if we have finite time 
collapse in the center of mass for a system of particles, by the general stability properties 
of gradient flow solutions we can also deduce the finite time collapse for general compactly 
supported initial data. 

6. Examples of asymptotic states in the non convex case 

In this section, suppose that assumptions (II. 8ft and (|1.9p hold. We have already seen in the 
case of particle collapse that, for convex nonnegative potentials, there is the asymptotic state 
consisting of a single Dirac mass. It is easily seen that any Dirac mass minimizes functional 
W. So, in this case, by the translation invariance property, the only asymptotic solution is in 
fact the Dirac mass in the center of mass of the initial datum. 

In the repulsive-attractive case, one can expect a much bigger variety of stationary solu- 
tions. This is the case as proved in ID in |FRHlFR2] in which the authors show that the set of 
stationary states for short-range repulsive long-range attractive potentials can be very large 
and complicated. Moreover, they give examples in which the stationary states are composed 
of a finite number of Dirac deltas at points and others in which one has integrable compactly 
supported stationary solutions. They also show that to have integrable or not stationary 
states depends in how strong the repulsion is at the origin. Some numerical computations 
indicates that this is also the case in more dimensions |KSUB| . They show that the set of 
stable stationary states can be large and with complicated supports arising from instability 
modes of the uniform distribution on a sphere. Finally, in |BCLR| we have studied the stabil- 
ity for radial perturbations of the uniform distribution on a suitable sphere for general radial 
repulsive-attractive potentials. We will show a related example below. 

In this section we are going to present some examples, based on the available characteriza- 
tion of the velocity vector field of the continuity equation of properties of the set of stationary 
solutions. In particular, we will consider the (— l)-convex potentials in one dimension given 
by 

W(x) = -\x 2 - 1| 2 , W(x) = ^ \x 2 - 1|. 

Both cases correspond to attractive-repulsive potentials with the same behavior in the repul- 
sive part at the origin. However, the change from repulsive to attractive in one case is smooth 
and in the other, it is only Lipschitz. In fact, in the two cases there many analogies, but also 
some different behaviors, as we are going to show with the next propositions. First of all, we 
search for stationary states made by a finite number of particles. 

Proposition 6.1. There exist two-particles stationary states for W and W . 

Proof. Let \i = mi5 Xl + m20~ X2 , with mi + mi = 1. For functional W we have to impose the 
condition 

(6.1) ^ =m j VW(x j - Xi ) = 0, 
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for i,j = 1,2,? 7^ j. But X7W(x) = 2x(x 2 — 1). Hence it is enough to choose x±,x 2 such that 
\ x i — %2\ = 1) independently of the weights. We see that we have infinitely many stationary 
states made by two Dirac deltas. In the case of W, the subdifferential is 

x if \x\ > 1, 
—x if \x\ < 1, 
-1,1] if|x| = l. 

In order to find a stationary state, we have to solve (|6.ip with r/ in place of VW, where 
rj is the suitable anti-symmetric selection in dW realizing the minimal norm in dW(p) (see 
Corollary I4.2p . Such selection is then found minimizing the quantity 



(6.2) dW(x) 



I 7 ? * HI^R^R) — XT ' 



rn\rn 2 (rn\ + m 2 )('q{xi - x 2 )Y 



'^2m i r](x j - xi) 

j=i i=i 

among the admissible selections 77 (we are using the anti-symmetry). Let again \x 2 — x±\ = 1, 
so that it is clear that the minimum above is zero, attained for tj(1) = —rj(—l) = 0. And this 
way, the two equations (|6.ip are still satisfied. □ 



Proposition 6.2. For both junctionals W and W , there are no absolutely continuous sta- 
tionary states in one dimension. 

Proof. Let us consider functional W. The argument is based on the fact that, if p is absolutely 
continuous, it does not charge the points of non-differentiability of W. For a measure [i to 
be stationary, we have to verify that the corresponding velocity vector field vanishes. That is 
rj * fi = 0, where rj is the usual optimal selection in dW, as in Corollary 14.21 Suppose that \x 
is a stationary state and that fi = pC 1 , for some p S L 1 (M), then 



{x-y)p{y)dy - I (x - y)p(y) dy = 0. 

{\x-y\>l} J{\x~y\<l} 

By the translation invariance property, we can fix without loss of generality the center of 
mass, so we let J K yp(y) dy = 0. We deduce 



2x / p(y) dy - x = 2 / yp(y) dy, 

J{\x-y\>l} J{\x-y\>l} 

hence 

(px—l f+OO \ / px—1 r+00 

/ P(y) dy+ p(y) dy J - x = 2 I / yp{y) dy + yp(y) dy 
J-00 Jx+l / \J-00 Jx+1 

Let us denote the term in the parenthesis in the left side by Q(x) and let us take the derivative 
with respect to x. We have 

29(x) + 2x(p(x - 1) - p(x + 1)) - 1 = 2((x - l)p(x - 1) - (x + l)p(x + 1)), 

which yields 26(x) - 1 = -2(p(x - 1) - p(x + 1)), that is Q'(x) = -Q(x) + \. We find 

@(x) = ke~ x + -, keR, 

then p(x — 1) — p(x + 1) = —ke~ x . But the integral of p is 1, so k = and we are left with 
p(x — 1) = p(x + 1). This is a contradiction, since p can not be periodic in this case. The 
proof for W is analogous, we omit the details. □ 
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The following are more examples of stationary states 

Example 6.3. There are stationary states for functional W of the form 

\i = mx6 xi + m 2 5 X2 + m 3 5x 3 , 
with mi + ^2 + m 3 = 1. Indeed, we have to verify that 

dxi 
IT 



(6.3) 



m 2 r)(x 2 - xi) + m 3 r/(x 3 - x{] 



= mirj(xi - x 2 ) + m 3 r](x 3 

dx 3 
dt 



x 2 



miT](xi - x 3 ) + m 2 r){x 2 - x 3 ) = 0, 



where, as usual, r) represents the anti-symmetric selection in the subdifferential (j6.2[) given 
by Corollary [421 

For instance, let us search for a solution in the following range 

(6.4) x 2 — xi = 1, x 3 — xi > 1, < x 3 — x 2 < 1. 

We begin searching for the right selection. We use the notation rj^ := rj(xi — Xj). Clearly we 
need to select only at ±(x 2 — xi). Taking the anti-symmetry into account, and recalling that 
the subdifferential of W is (|6.2p and that the relations (|6.4p hold, there is 



1^7*^11x2/ 



E 



y~)miT](xj - xi) 



i=i 



= mi{mWi2 + 2m 2 m 3 (xi - x 3 )r? 12 ) + m 2 {mW 2l + 2m 1 m 3 (x 3 - x 2 )rj 21 ) + R 

= mim 2 (m\ + m 2 )r)\ 2 + 2mim 2 m 3 (xi + x 2 — 2x 3 )rii 2 + R 2 , 

where the remainders R±, R 2 do not depend on the value of rj at ±1, hence we only have to 
minimize with respect to ?7 12 on the interval [—1, 1]. We have a quadratic function, so that if 
the vertex of the parabola is on the right of the interval [—1,1], then the minimizer is found 
for rji 2 = 1, hence we get rj(—l) = 1 and then, by anti-symmetry, rj(l) = — 1. Computing 
the vertex position, this condition is 

m 3 (x 3 -x 2 ) + m 3 (x 3 



xi 



> 1. 



(6.5) 

mi + m 2 

Hence, if such condition holds, making use of (|6,2p the first two equations in (|6.3p reduce to 

j -m 2 + m 3 (x 3 - xi) = 0, 
I mi - m 3 (x 3 - x 2 ) = 0. 



We deduce 



and since 1 = x 2 — x\ 



x 3 - Xi 
(x 3 - xi) - 



and x 3 — x 2 



_ m 2 
m 3 

(x 3 — x 2 ), we have 



mi 
m 3 



m 3 = m 2 — nil. 



We may pose, for < a < i, 



mi = - 



a, 



m 2 = -, 
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m 3 



1 

- + a. 

4 



Then we find 

(6.6) x 3 - Xl = 3-^, ^3-x 2 = rT ^. 

This way, the constraints in (j6.4j) are satisfied. Moreover, it is immediate to check that (j6.3|) 
is solved and that 

m 3 (a;3 - x 2 ) + m 3 (x 3 - xi) _ 

m l + m 2 

so that (|6.5p is verified and the computation is indeed consistent. 

In order to find the three points, we can for instance fix the center of mass in the origin: 

1 - 4a 1 1 + 4a 

^ X1 + -X2 + — - — x 3 = 0. 

Together with (|6.6p . this gives 

1 - 4a 

xi = -1, x 2 = 0, x 3 = — — — . 

1 + 4a 

We conclude that, for < a < \, 



is a stationary state. 

Remark 6.4. In the case of W, it is very easy to construct an analogous example, solving 
system (|6.3p . where this time the actual gradient of W appears. In both cases, it seems clear 
that the procedure can be repeated for finding infinitely many stationary states with N Dirac 
masses for any N > 3. 

We conclude with an example in two space dimensions. The reference functional is simply 
the radial of W, still denoted by W, that is 

W(x) = -\\x\ 2 - l|, xeM 2 . 

The functional is still (— l)-convex, and in this case 

x if \x\ > 1 
8W(x) = { -x if \x\ < 1 

if|x| = l. 

Example 6.5. Let or denote the uniform measure on the circumference dBji(0), of radius 
R, centered in the origin. There exists R > such that the measure or is a stationary state 
for functional W. 

Indeed, we can show that for any x G 8Br(0) there holds r] * or = for a suitable choice 
of the radius R, r] being the optimal selection of Corollary 14.21 Explicitly, the convolution is 

/ (x-y)do R (y) - (x - y) do R (y), 

J{\x-y\>i} -ni^l<i} 

29 



and the set of points {y : \x — y\ = 1}, where one should select, is negligible. Fix x on the 
circle. We let (ei,e2) be an orthogonal base in M 2 , where ei is the direction of x, so that 
x = R&\. Hence we have to solve 

Re 1 a R ({\x-y\>l})-Re 1 a R ({\x-y\<l})- yda R {y)+ yda R (y) = 0. 

J{\x~y\>l} J{\x-y\<l} 

We write the integrals in polar coordinates with respect to the origin and the vector ei. In this 
system, we let a = a(R) denote the (positive) angle corresponding to the intersection point 
between dB R (0) and the circle of radius 1 centered in x (see Figure 1 below). In particular 



R 2 -\ 



(6.7) sina(i?)- 

R z 

Since y = Rcos6e\ + Rsin9e2, it is immediately seen, by oddness of the sine function, that 
the equation in the direction of e2 is identically satisfied. In the direction of ei we find 

r"K pa pn pa 

2R Rd6-2R Rd8-2R cos0Rde + 2R cos9Rd6 = 0. 

J a JO J a JO 

Hence we have to solve 

f(R) := 7r — 2a(R) + 2sma(R) = 0. 

If R < 2i the computation does not make sense, but indeed we can not have a stationary 
state for R < h, since in this case any point in dB R (0) has distance lower than 1 from x, so 
that for each of them the effect on x is a repulsion, and x tend to move far from the origin. 
Taking (|6.7p into account, if R = we have a(R) = ir, hence the value of / at ^ is — ir. As 
R increases from i to +oo, the angle a{R) decreases from tt to 0. Notice that the function 



is increasing from ^ to where it has its maximum, and is decreasing in (-^, +oo). On the 
other hand, sin(x) — x is a decreasing function. Since f(\^2/2) = 2, we conclude that / has 
only one zero, found in the interval (1/2, \/2/2). If Ro is the zero, for R = Rq the measure 
a R is stationary. 



7. Appendix: vector and plan subdifferential 

Here we give a more complete overview about the Wasserstein subdifferential. In [AGS! 
§10.3], the theory is developed for functionals $ : &2(W i ) ( — oo, +oo] such that 

(7.1) $ : ^2(lR d ) — > (— oo,+oo] is proper and lower semicontinuous in ^(K^) 
and 

(7.2) <!>(•) H dyy(-,/J,) admits minimizers for any small enough r > and fi G & > 2^ d )- 

2t 

Indeed, after Remark 13.61 we know that W satisfies these hypothesis. Hence, we are in the 
framework of [AGS I §10.3]. We will show how the results therein work for the case of W. 
First of all, we remark that Definition 12.21 is equivalent to the following one. 
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Figure 1. The construction of Example 16.51 



Proposition 7.1. £ £ L^R^, /x; R ') belongs to the Wasserstein subdifferential ofW at /j, G 
^ 2 (M d ) i/ and only if 

(7.3) WM-%)> / (C(s),l/-x)d7(x,i/) + o(C(/i,»/;7)) 

JR d 

as v — )• /i, in some optimal transport plan 7 E r o (/x, v). Moreover, £ is a strong 

subdifferential if and only if (j7.3|) /10/ds whenever v {i in ^ > 2(^ d ) andT(/j,,i>) 3)-> (i,i)#// 
in ^ 2 (K d x R d ). 

Proof. Let 7 G T(/i, 1/) and define the interpolating curve 6>T(t) = ((1 - ^vr 1 + £tt 2 ) # 7 between 
/x and u, so that 0(0) = /U and 0(1) = ^. We take advantage of a property of Wasserstein 
constant speed geodesies, shown in [AGS! Lemma 7.2.1]: there exists 7* in T Q (fi, v) such that 
T o (fi,0' y (t)) contains a unique element for any t G [0,1), given by jt '■= (vr 1 , (1— t)7r 1 +t7r 2 )^7*. 
Then, (|7,3p can be applied in correspondence of jt and with 7 (t) in place of v, and together 
with (HJ2), it gives, for t -)• 0, 

ww _ %) >^t*) + » (1 _ t)C . (Mlfl 

> / («(l'),!/-I>'27*(l',!/) + |(l-<)C 2 (M,l';7*) + 70(C(M,«(t);7 l )). 
JR d xR d 1 1 

Passing to the limit as t — > 0, since C(/i, 0(i);7i) = t 2 C(n, u; 7*), one gets (|2.3|) for the 
plan 7* G r o (/i, z^). One reasons in the same way for the equivalence in the case of strong 
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subdifferentials: indeed, one can define # 7 (i) for the generic plan 7 G u) and use the 
convexity of W along any interpolating curve t 1— > # 7 (i) (see Proposition 12. lj) . □ 

On the other hand, the general definition of subdifferential, given in [AGS! §10.3], is more 
technical. According to that notion, the subdifferential is in fact a plan /3 G &2{^- d x ^- d ), as 
in the following 

Definition 7.2 (Plan subdifferential). Let n G ^ 2 {^ d )- We say that j3 G &> 2 (^ d x K ') 
belongs to the extended subdifferential ofW at fi if = fj, and there holds 

(7.4) W(v) - W(fi) > [ {y,z- x)dfi(x,y,z) + ^ I \z - x\ 2 dfi(x,y, z) 

Jr^xR^xR 1 * 1 JR d xR d xR d 

for some /x G T (f3,v) (we obtain a strong subdifferential if the inequality holds for any plan 
li G T((3,u)). We write /3 £9W(/i) (resp. (3 G0sW(/i) ). Here the elements ofT((3,/j,) are 
three-plans, that is, measures in ^{W 1 x IR^ x M. d ), such that vr^ 2 /x= f3 and 7r^./z= v. The 
definition of optimal plan in this case is 

T (p,v) := {7 G T(fi,v) : tt 1 '^ € T (n,u)}. 

Remark 7.3. Since W is convex along any linearly interpolating curve, as noticed in Propo- 
sition [2H1 from [AGS! Theorem 10.3.6] we learn that we can equivalently define the extended 
subdifferential of W by asking inequality (17.41) for any fj, G T (/3, v). 

Remark 7.4. We observe that if j3 is concentrated on the graph of a vector field £, we have 
/3 = (i, an d i n particular y = £(x) for /x-a.e. (x,y,z) G M rf x M. d x In this case the 
definition reduces to (12.31). 



We recall that, also for extended subdifferentials, there holds 

1/2 



\dW\(fi) = argmin J (J \y\ 2 d^(y)\ : /3 G ffW(p) 



Moreover, the corresponding minimizer is unique. See [AGS! Theorem 10.3.11]. We denote it 
by a°W(/i). 

We have the following 

Lemma 7.5. Let fx G )• The following assertions hold: 

(7.5) /3 G S s W(m) /9 G 0sW(/x) and /3 G flW(/i) => /9 G 9W(/i). 

Proof. We begin with the proof for strong subdifferentials. Let f3 G dsW^) and we write 
P = JiRd Px dfi(x). For any 7 G T(/i, 1/) there holds 



/ 

JR C 



{/3(x),z - x) d-y(x,z) = (/ ydp x (y),z - x) d>y(x,z) 

■xIR* JR d xR d / 

= / (y,z - x) d/3 x (y)d'j(x,z). 

JR d xR d xR d 

Moreover, taking into account that J Rt j d(3 x (y) = 1 for /i-a.e. x G we have that 

\z — x\ 2 dj(x, z) = / |z — x| 2 d(3 x {y) 07(3;, 2). 

<R d -/r^xR^xR^ 
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Let us define the three-plan fj, as 

cj)(x,y,z)dfj,(x,y,z) := I <j)(x, y, z)df3 x (y) d>y(x, z) 



J 



for all continuous functions <p :M. d x M. d x M. d — > M with at most quadratic growth at infinity. 
Then, fi belongs to T(f3,v). Making use of ([7.4p for this particular choice of /zG T(/3, v), we 
see that f3 satisfies (12, 3p . Recalling Remark 17.31 since 7 G r o (/x, !/)=>/j£ r o (/3, 2/), reasoning 
as done for strong sub differentials the implication 

/3 G =► /3 G dW(fi) 

follows. □ 

Corollary 7.6. If f3 is the minimal selection in the extended subdifferential ofW at fx, then 
it coincides with (i, f3 )4£f/,, and (3 is the minimal selection in dW(fi). 

Proof. For any (5 G dW(fi), using Jensen's inequality, there holds 

\y\ 2 dnpiy) = [ \y\ 2 di3(x,y) = [ [ \y\ 2 df3 x (y) dfi(x) > [ \P{x)\ 2 dfi{x). 

JR d xR d JR d JR d JjRd 

This shows that the barycentric projection does not increase the norm. Therefore, if /3 is the 
minimal selection, since j3 G dW(fi), there has to hold f3 Q = (i,j3 )#[i. In this case it is also 
clear that f3 = d°W{fi). □ 

Remark 7.7. Because of (|7.5p . under the same assumptions of Theorem 13.21 one sees that if 
(3 £dsW(fi), then its barycenter /3 is given by f|3.3[) . 

Remark 7.8. In the case of strong subdifferentials, the implication of Lemma 17.51 is true for 
any functional <1> satisfying the assumptions (|7.ip and (|7.2|) . It is shown in Lemma 10.3.4 and 
Remark 10.3.5 of [AGS] that, given fx G ^2(^ )1 a minimizer fi T to <&(■) + j- c^(-,/x) and a 
plan % G T (/jl t , fi), there holds 

7r G d S ^(fJ, T ), 

where 7 r is the rescaled of 7 r (see Definition 13. 12j) . Moreover, among these rescaled plans, 
there exists a plan whose barycenter belongs to ds&(fi)- After Lemma 17.51 we may indeed 
infer that this holds true for the rescaled of any optimal plan in T (/j, T , /j,) . 

Eventually, we are ready for the proof of Proposition 13.131 We make use of the general 
convergence properties of rescaled plan subdifferentials shown in [AGS! Theorem 10.3.10], 
passing to barycenters by means of Lemma 17.51 and Corollary 17.61 



Proof of Proposition 13.131 Let r > be small enough. Once more, let n G ^ 2 (K d ), let 
fi T minimize W(-) + /i) and let 7 r G T (fi T , fj,). Moreover, let 7 T be the rescaled of 7 T 

(as given by Definition 13. 12j) . As a consequence of Theorem 13.11 for any \i G & > 2(^ d ) the set 
dW(p) is not empty. Therefore we are in the hypotheses of [AGS} Theorem 10.3.10], which 
entails, taking into account also |AGSl Remark 10.3.14], 



lim 7t = d°W(u) in ^ 2 (R d x R d ). 

T — ^0 

But Corollary 17.61 implies that d°W(fi) = (i, d°W{fi))#fi. The convergence above then means 
that, as t — > 0, 

(7.6) / 4>(x,y)d( lT Uy)d f i T (x)^ [ ftx, y) d(i, d°W(»))#n(x,y) 

JR d xR d JR d xR d 
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for any continuous function (j> : M. d x R d — > R with at most quadratic growth at infinity, where 
( r y T ) x denotes the family of measures which disintegrates 7 T with respect to fi T . Letting 
C £ C^°(R d ; R rf ), and choosing (f)(x,y) = (y,C(x)) in (17. 6p . we obtain the convergence in the 
sense of Definition 13.81 of 7 T to d°W(n). On the other hand, using Jensen inequality (in the 
same way as in the proof of Corollary 17.6 p and (17.6P with (j)(x,y) = \y\ 2 we obtain 

limsup / |7 T | 2 ^ r <lim / \y\ 2 d lr = [ \y\ 2 d(i, = / \d°W{p)\ 2 dfi, 

hence we also have the strong convergence in the sense of Definition 13.81 □ 
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